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Abstract 

We investigate decompositions of a graph into a small number of low diameter subgraphs. Let 
P{n, e, d) be the smallest k such that every graph G = (V, E) on n vertices has an edge partition 
E = Eq U El U . . . U Ek such that \Eo\ < ev? and for all 1 < i < fc the diameter of the subgraph 
spanned by Ei is at most d. Using Szemeredi's regularity lemma, Polcyn and Rucihski showed 
that P(7i, e, 4) is bounded above by a constant depending only e. This shows that every dense 
graph can be partitioned into a small number of "small worlds" provided that few edges can be 
ignored. Improving on their result, wc determine e, d) within an absolute constant factor, 
showing that P(n, e,2) = Q{n) is unbounded for e < 1/4, P(n, e,3) = 0(l/e^) for e > n~^/^ and 
P{n, e, 4) = 8(l/e) for e > n~^. We also prove that if G has large minimum degree, all the edges of 
G can be covered by a small number of low diameter subgraphs. Finally, we extend some of these 
results to hypergraphs, improving earlier work of Polcyn, Rodl, Rucihski, and Szcmercdi. 

1 Introduction 

The distance between two vertices of a graph is the length of the shortest path between them. The 
diameter diam(G) of a connected graph G = (V, E) is the maximum distance between any pair of 
vertices of the graph. If G is not connected, diam(G) = oo. For an edge subset E' C -E, the diameter 
of E' is the diameter of the subgraph of G with edge set whose vertex set consists of all the vertices 
of G which belong to at least one edge of E' . 

Extremal problems on the diameter of graphs have a long history and were first investigated by Erdos 
and Renyi [5] and Erdos, Renyi, and Sos [6], who studied the minimum number of edges in an n- vertex 
graph with diameter at most d. Another line of research concerning the change of diameter if edges 
of the graph are added or deleted was initiated by Chung and Garey [2] (see also, e.g., |3l[l]). In this 
paper, we investigate decompositions of a graph into a small number of subgraphs of low diameter. A 
motivation for such decompositions comes from distributed computing. We are given a set of processors 
(vertices) with communication channels (edges) between pairs of processors. A fundamental problem 
when designing algorithms on such systems is determining how much coordination must be done 
between the processors and accomplishing this coordination as efficiently as possible. The simplest 
approach is to centralize the network by appointing one processor to coordinate the actions of the 
network. This approach often simplifies the problem and leads to distributed algorithms based on 
known serial algorithms. However, if the network has large diameter, such rigid centralization can 
degrade system performance due to delays in communication. One solution to this problem is to 
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partition the network into regions of low diameter. This approach was used for example by Linial and 
Saks [9] who showed that every graph on n vertices can be vertex partitioned into O(logn) induced 
subgraphs whose connected components have diameter O(logn). 

In this paper we will consider another variant of the low diameter decomposition problem, which 
was also studied by several researchers. In this problem the goal is to partition nearly all the edges 
of a graph into a small number of low diameter subgraphs. More precisely we study the following 
parameter. 

Definition 1 Let P{n, e, d) be the smallest I such that every graph G = (V, E) on n vertices has an 
edge partition E = Eq U Ei U . . . U Eg such that \Eq\ < er? and the diameter of Ei is at most d for 
\<i<l. 

Polcyn and Rucinski |12| recently showed that every dense graph can be partitioned into a small 
number of "small worlds" provided that a small fraction of the edges can be ignored. Specifically, they 
proved that -P(n, e, 4) is bounded by a constant depending only on e. Their proof relies on Szemeredi's 
regularity lemma and consequently gives an enormous upper bound on P(n, e, 4) as a function of e, 
i.e., it shows that -P(ra, e,4) can be bounded from above by a tower of 2s of height polynomial in 1/e. 
One of our main results determines -P(n, e, d) up to a constant factor. It improves on the result of 
Polcyn and Rucinski both on the diameter bound and on the number of parts. 

Theorem 1 (a) For e < 1/4 hounded away from 1/4, we have P{n,e,2) = Q{n). 
(h) Fore> n-^/^ we have P(n,e,3) = e{l/e'^). 
(c) For e > , we have P{n, e, 4) = 0(l/e). 

There is a sharp transition in the behavior of the function P{n, e, 2) at e = 1/4, namely P{n, 1/4, 2)= 
1. Note also that P{n,e, 1) is the minimum number of edge-disjoint cliques needed to cover all but 
en^ edges in any graph on n vertices. This parameter is not so interesting to study as for e not too 
large, P{n, e, 1) is quadratic in n by considering the complete bipartite graph with parts of equal size. 

Extremal problems on the diameter of graphs with large minimum degree have also been studied. 
For example, Erdos et al. [1], answering a question of Gallai, determine up to an additive constant 
the largest possible diameter of a connected graph with a given number n of vertices and minimum 
degree 6. The answer is within an additive constant of ^^j-- ^'^^ graphs with large minimum degree, 
we can show that all edges of such graphs can be covered by a small number of low diameter subgraphs 
(which are not necessarily edge-disjoint). 

Definition 2 Let Q{n,e,d) be the minimum i such that the edges of any graph G = {V,E) on n 
vertices with minimum degree at least en can he covered by I sets E = Ei L) . . . L) Ei such that the 
diameter of each Ei is at most d. 

We prove the following two results, in which we use the properties of Kneser graphs to establish 
lower bounds. 

Theorem 2 (a) For fixed < e < , both Q{n,e,3) and Q[n,e,4:) have order of magnitude Q {log n). 
(h) Q(n,e,5) = Q{l/e^) andQ{n,e,6) = e(l/e). 
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Note that any graph on n vertices of minimum degree more than § — 1 has diameter at most 2, 
since non-adjacent vertices must have a common neighbor. In sharp contrast with Theorem [2]^ a), this 
shows that Q{n, 1/2, 2) = 1. 

An analogous problem for hypergraphs was first investigated by Polcyn, Rodl, Rucihski, and Sze- 
meredi jllj . A hypergraph G = {V,E) is k-uniform if each edge has exactly k vertices. A (tight) path 
of length ^ in a /c-uniform hypergraph G = {V, E) is a subhypergraph consisting of £ + — 1 vertices 
ui, . . . , f^+fc-i and I edges, such that for each i < I, (uj, fj+i, . . . , f is an edge of G. The vertices 
vi and V£j^k~i are the endpoints of the path. The distance between two vertices v,w \n a. hypergraph 
is the length of the shortest path whose endpoints are v and w. The diameter diam(G) of G is the 
maximum distance between any two vertices of G. 

Definition 3 Let Pk{n,€,d) be the smallest i such that every k-uniform hypergraph G = (y,E) on n 
vertices has an edge partition E = Eq U Ei U . . . U E^ such that \Eq\ < en^ and the diameter of E^ is 
at most d for 1 < i < k. 

Polcyn et al. [11] showed that ^3(71, e, 12) is bounded above by a constant C^^e) depending only on 
e. Their proof uses the hypergraph regularity lemma, and gives an Ackermann-type upper bound on 
C3(e). Here we improve the diameter bound from 12 to 3, which is best possible, generalize it to any 
uniformity k, and further show that this function is polynomial in e~^. The proof uses similar counting 
arguments as done in the graph case. 



We study P{n, e, d) in the next section, where we consider the cases (i = 2, 3, 4 in three separate 
subsections. In Section [3l we study edge partitions of graphs of large minimum degree into a small 
number of low diameter subgraphs. In SectionUl we prove bounds on Pk{n, e, d). The last section of the 
paper contains some concluding remarks and open questions. Throughout the paper, we systematically 
omit floor and ceiling signs whenever they are not crucial for the sake of clarity of presentation. We 
also do not make any serious attempt to optimize absolute constants in our statements and proofs. 
All logarithms in this paper are in base 2. 

2 Proof of Theorems [1] 

In this section, we prove bounds on P(n, e, d). We consider the cases d = 2, 3, 4 in separate subsections. 
2.1 Decomposing a graph into diameter 2 subgraphs 

In this subsection we prove Theorem [T]J^a), which states that if e < 1/4 is bounded away from 1/4, then 
P(n, e,2) = G(n). The proofs for both the upper bound and for the lower bound are quite simple. 
We first show the upper bound. Let G be a graph with n vertices. Note that a star has diameter 2. 
Letting Ei be those edges that contain the ith vertex of G, we have P{n, 0, 2) < n. 

To prove the lower bound we use the following simple fact. If G is a bipartite graph, then any 
subgraph of G which is not complete has diameter at least 3. We show next that if G is a random 



Theorem 3 We have Pfe(n, e, 3) = O ie 




In the other direction, we show that Pk{n,e,3) > c^.e 



for e ^ n 



which we think is tight. 
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bipartite graph of edge density bounded away from 1, then almost surely every complete bipartite 
subgraph of G has 0{n) edges. The random bipartite graph G{n, n,p) is the probability space of labeled 
bipartite graphs with n vertices in each class, where each of the edges appears independently with 
probability p. The term almost surely means with probability tending to 1 as n tends to infinity. 

Lemma 1 Fix < p < 1 and let q = 1 — p- Almost surely, all complete bipartite subgraphs of the 
random bipartite graph G = G{n,n,p) have at most 2n/q edges. 

Proof: Let A and B be the two vertex classes of G. The probability that there is S" C vl and T C B 
that are complete to each other and have at least 2n/q edges between them is at most 

□ 

The Chernoff bound for the binomial distribution implies that the random graph G{n,n,p) almost 
surely has pn? + o(n^) edges. Let e < q = j — e and p = | + e. By considering G{n/2, n/2,p), we 
have that there is a bipartite graph G on n vertices with at least (p— o(l))n^/4 = (e+^ — o(l))n^ edges 
and such that every diameter 2 subgraph of G has at most 2n/q edges. To cover — o(l))n^ > qn'^/2 

edges of G by diameter 2 subgraphs, we need to use at least = q'^n/A subgraphs. Hence 

P{n, e, 2) > q^n/A = for n sufficiently large. This completes the proof of Theorem [1]^ a). □ 

We conclude this subsection by showing that a sharp transition for P{n,e,2) occurs at e = 1/4. 
Namely, P(n, 1/4, 2) = 1. If a graph G has a vertex which is adjacent to less than half of the other 
vertices, delete it. Continue deleting vertices until all vertices in the remaining induced subgraph Gi 
are adjacent to at least half of the other vertices of Gi. Graph Gi has diameter at most 2 by the 
discussion after Theorem [21 Let Gq be the subgraph whose edges are those containing a deleted vertex. 
Then Gq has at most ^(2) < n^/4 = en^ edges and therefore P{n, 1/4, 2) = 1. 

2.2 Decomposing a graph into diameter 3 subgraphs 

Studying P(n, e, d) appears to be most interesting in the case d = 3. In this subsection, we prove 
Theorem [TJ^b), which establishes P{n,e,3) = 0(l/e^) for e > n^^/^. We begin with a few simple 
lemmas. 

3 

Lemma 2 If G is a graph with n vertices and m > 12n edges, then G has at least paths of length 
three. 

Proof: Delete from G vertices of degree at most ^ one by one. The resulting induced subgraph G' 
has at least m — = m/2 edges, and has minimum degree at least For each edge e = {u,v) of 
G' , u and v each have at least ^ neighbors, so the number of paths of length three with middle edge e 

2 

is at least — 1)(^ — 2)> ^ as there are at least ^ — 1 possible choices for the first vertex of the 
path, and, given the first three vertices of the path, at least ^ — 2 remaining possible last vertices for 
the path. Counting over all m/2 possible middle edges, we obtain that the number of paths of length 

2 3 

three in G' (and hence in G) is at least = j^- ^ 

The next definition demonstrates how to construct a subgraph of diameter at most 3 from a graph 
and a pair of its vertices. 
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Definition 4 For a graph G and vertices v and w of distance at most d, let Gd{v,w) be the induced 
subgraph of G consisting of all vertices of G that lie on a walk from v to w of length at most d. 

Lemma 3 The graph G(i{v,w) has diameter at most d. 

Proof: Let a and b be vertices of Gd{v, w). So a is on a walk from v to i(; of lengtli at most d, and b 
is on a walk from v to w oi length at most d. These two walks give rise to two walks from a to 6 such 
that the sum of the lengths of these two walks is at most 2d. Hence, there is a path from a to 6 of 
length at most d. This shows that Gd{v, w) has diameter at most d. □ 

Lemma 4 If a graph G has n vertices and m > 12n edges, then it contains an induced subgraph H 

3 

with at least edges that has diameter at most 3. 

Proof: By Lemma O G has at least paths of length three. By averaging, there is a pair u,v of 
vertices of G such that the number of paths of length three with terminal vertices u and v is at least 

3 1—1 

By Lemma [3l G^{u,v) has diameter at most 3 and in every paths of length three from u to v, 
the middle edge is an edge of G^{u, v). Moreover, each edge in Gs{u, v) is the middle edge of at most 

2 

two paths from u to v, hence G3{u,v) has at least edges. □ 
We now prove a quantitative version of the upper bound on P{n, e, 3) in Theorem [TJb). 

Theorem 4 Every graph G on n vertices can be edge partitioned E = Eq U Ei L) . . . L) Ei^ such that 
\Eo\ < en'^, k < 50e~^, and for 1 <i <k, the diameter of Ei is at most 3. 

Proof: We repeatedly use Lemma H] to pull out subgraphs of diameter at most 3 until the remaining 
subgraph has at most en^ edges. The remaining at most en^ edges make up Eq. If the current graph 
has at least m/2 edges, then by the above lemma we can find a subgraph of diameter at most 3 with at 
least ^^2n^ edges. Therefore, after pulling out s = (m — m/2)/ ^^^^l = 128n^/m^ such subgraphs of 
diameter at most 3, we remain with at most m/2 edges. Similarly, applying this process to a subgraph 
with at most 2*en^ edges we get a subgraph with at most 2*~^en^ edges, after pulling out at most 
~ 2'^~^'*e~^ subgraphs of diameter 3. Summing over all i > 1, we obtain that altogether we pull 
out at most YliLi 2'^~^V-2 = ^£-2 ^ 50^-2 subgraphs. □ 

Next we establish a lower bound on P{n, e, 3), using the following two lemmas. 

Lemma 5 Almost surely the subgraph of diameter at most 3 of the random bipartite graph G{n,n,p) 
with p = with the maximum number of edges has (1 + o{l)){2pn + p^n^) = (|| + o(l))y^ edges. 

The neighborhood N{v) of a vertex v m. a, graph G is the set of vertices adjacent to v. To see that 
there is almost surely a subgraph of diameter 3 of G{n,n,p) with (1 + o{l)){2pn + p^n^) edges, let a 
and b be adjacent vertices in different vertex classes of G{n,n,p), and consider the induced subgraph 
H with vertex set N{a) U N(b), which has diameter at most 3. It follows from Chernoff's bound 
for the binomial distribution that almost surely all vertices of G{n,n,p) have degree (1 + o{l))pn. 
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Hence, the number of edges containing a or 6 is (1 + o(l))2pn. Furthermore, given |A^(a)| and |A^(6)|, 
the number of edges between N{a) \ {b} and N(b) \ {a} follows a binomial distribution, and another 
application of Chernoff 's bound for the binomial distribution implies that the number of these edges 
is (1 + o(l))p^n^. Hence H has (1 + o{l)){2pn + p^n^) edges. To prove Lemma El it thus suffices to 
show that every diameter 3 subgraph has at most (1 + o(l))(2pn + p^n^) edges. 

Since almost surely the number of edges of G{n,n,p), p = is concentrated around it expected 
value pn'^, we have the following corollary. For all sufficiently large n, there is a bipartite graph on 2n 
vertices with at least |?t,^/^ edges in which any diameter at most 3 subgraph has at most fra^/^ edges. 

For a graph G, the blow-up G{r) denotes the graph formed by replacing each vertex Vi of G by an 
independent set Vi of size r, where vertices u £ Vi and w £ Vj are adjacent in G{r) if and only if Vi 
and Vj are adjacent in G. 

Lemma 6 // any subgraph of a graph G with diameter at most d has at most m edges, then any 
subgraph of G{r) with diameter at most d has at most r'^m edges. 

Proof: Let H he a. subgraph of G{r) of diameter at most d. Let H' be the induced subgraph of G 
where Vi is a vertex of H' if there is a vertex of in V^. It is clear from the definition of G{r) that 
the diameter of H' is at most the diameter of H. Thus H' also has diameter at most d and so it has 
at most m edges. Then H has at most r^m edges, since for every edge {vi,Vj) of H' there are at most 
edges ofHmVixVj. □ 

^From Lemmas [S] and El we quickly deduce a lower bound on P{n, e, 3). Of course, we are assuming 
here that e < 1/2 as otherwise en^ > (2) and P(n, e, 3) = since we can let Eq consist of all edges of 
the graph. 

Theorem 5 We have P{n, e, 3) > ce~^ for some absolute constant c > 0. 

Proof: Let t = (40e)~^ and r = By choosing an appropriate constant c we may suppose that e 
is sufficiently small and so t is sufficiently large. Therefore, by Lemma O there is a bipartite graph 
G with 2t vertices, at least ^t'^^^ edges such that every diameter 3 subgraph of G has at most 
edges. The blow-up graph G{r) has n vertices, at least if^/^r^ = 2en^ edges, and Lemma E] shows 
that any subgraph of G{r) with diameter at most 3 has at most |r^t^/^ = t~^/^n^/5 = 8(40)^e^n^ 
edges. Thus P{n,e,3) > §(49^)" ^3^2 ^ 8(40)^ which completes the proof. □ 

We next include a simple characterization of bipartite graphs of diameter at most 3. 

Proposition 1 A bipartite graph G with at least three vertices has diameter at most 3 if and only if 
each pair of vertices in the same vertex class have a common neighbor. 

Proof: Let A and B be the vertex classes of bipartite G with \ A\ > \B\. If a pair of vertices in the 
same vertex class does not have a common neighbor, then the shortest path between them must be 
even, have length at least four, and hence G has diameter at least four. 

Conversely, suppose each pair of vertices in the same vertex class have a common neighbor. If two 
vertices are in the same vertex class, then there is a path of length two between them. If a £ A and 
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b G B, then there is another vertex a' G A, and hence a and a' have a neighbor b' in S. If b' = b, then 
there is a path of length one between a and b. Otherwise, b' and 6 have a common neighbor, so there 
is a path between a and 5 of length at most three. This shows that the distance between any pair of 
vertices of G is at most 3, i.e., G has diameter at most 3. □ 

Notice that a bipartite graph with diameter less than 3 must be a complete bipartite graph, as if 
there is a pair of vertices in different classes that are not adjacent, then the shortest path between 
them is of odd length greater than 1, and so the diameter is at least 3. Hence a bipartite graph with 
at least three vertices has diameter exactly three if and only if it is not a complete bipartite graph 
and each pair of vertices in the same vertex class have a common neighbor. 

Our goal for the rest of the subsection is to prove Lemma O We will assume that p = , n is 
sufficiently large, and let A and B denote the vertex sets of size n of G{n, n,p). We first need to collect 
several basic lemmas about the edge distribution in G{n,n,p). 

Lemma 7 G{n,n,p) almost surely has the following six properties. 

(a) Every vertex has degree (1 + o(l))pn. 

(b ) Every pair of vertices have at most log n common neighbors. 

(c) For every edge {a,b), there are (1 + o{l)){2pn + p^n'^) edges between N{a) and N{b). 

(d) For all a G A and b G B non-adjacent, there are (1 + o(l))p'^n^ edges between N{a) and N{b). 

(e) For allYcB and X C A, there are at most \X\ + |yp log?i edges between X and Y. 

(f) For all A' C A and B' C B, there are at most t = 6inax(\A'\\B'\p, {\A'\ + \B'\) logn) edges between 
A' and B' . 



Proof: (a) The degree of each of the 2n vertices of G{n,n,p) follows a binomial distribution. Cher- 
noff 's bound for the binomial distribution implies that almost surely all of the degrees are concentrated 
around their expected value, pn. 

(b) The probability that a given pair of vertices in the same part have at least log n common neighbors 
is at most (iog„)p^'°^" < n^°^"(4y^)^^^°§" = There are 2(2) < pairs of vertices in the same 
class of G{n,n,p), so the probability G{n,n,p) has a pair of vertices with logn common neighbors is 
at most n~^n'^ = n~^. 

(c) By (a), almost surely, the degree of every vertex in G{n, n,p) is (H-o(l))pn. If a and b are adjacent, 
given |A^(a)| and |iV(6)|, the number of edges between N{a) \ {b} and N{b) \ {a} follows a binomial 
distribution. An application of Chernoff 's bound for the binomial distribution implies that a.s. for each 
edge {a,b) the number of edges between A^(a) and iV(6) is \N{a)\ + \N{b)\-l + {l+o{l))p\N{a)\\N{b)\ = 
(1 + o{l)){2pn+p^n'^). 

(d) Similar to (c), an application of Chernoff 's bound for the binomial distribution implies that almost 
surely for each pair a, b of non-adjacent vertices in different vertex classes, the number of edges between 
N{a) and N{b) is {1 + o{l))p\N {a)\\N {b)\ = {1 + o{l))p^n^ . 

(e) Let xi,. . . ,Xk be the vertices of X with at least two neighbors in Y. Since every dyixi) > 2 and 
(by (b)) each pair of vertices in Y has at most logn common neighbors, we have that 
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This implies that the total number of edges between X and Y is at most \X\ — k + ^^dyixi) < 
\X\ + |yplogn. 

(f) The result is trivial if A' or B' is empty, so we may assume they are nonempty. The number 
of pairs A' C A and B' C B of size \A'\ = a and \B'\ = b is QQ- For fixed A' and B' of 
sizes a and b, respectively, the probability that there are at least t edges between them is at most 

) < (^) < 2"* < n-'5(''+*). So the probability that there are such subsets A' and B' is at most 

Let be a subgraph of the bipartite graph G = G{n,n,p) of diameter at most 3, and X and Y 
be its vertex sets with \X\ > \Y\. In particular, according to Proposition [H each pair of vertices of 
X have a common neighbor in Y, and each pair of vertices of Y have a common neighbor in X. We 
suppose for contradiction that H has more than (1 + o{l)){2pn + p^n'^) edges. Adding extra edges to 
a graph on a given vertex set cannot increase the diameter of the graph, so we may suppose that H 
is the induced subgraph of G with vertex sets X and Y. In graph i7, let di > ^2 > . . . > d|y| be the 
degrees of the vertices of Y in decreasing order, and Vi denote the vertex of degree di. 

We first prove that H has few vertices. 
Claim 1 Almost surely, every diameter 3 subgraph H of G{n,n,p) has at most lOn^/^logn vertices. 

Proof: Suppose for contradiction that H has at least x = lOn^/^logn vertices. Since |X| > 
we have \X\ > x/2 = 5n"^/^logn. Lemma [Tl^f) shows that almost surely there are at most t = 
6max(|X| (|X| + log n) < 6|X| max 2 log n) edges between X and Y. Also, by Lemma 

[7)^a), a.s. the maximum degree satisfies A = (1 + o(l))np. Convexity of the function f{y) = (|) yields 




= (1 + o{l))npt/2 < Anp\X\ max 21ogn) , 



which is an upper bound on the number of pairs of vertices of X that have a common neighbor in 
Y. We have 4np|X| -plXj < ('^') and 4np|X| • 21ogn < ('^'). Hence, there are less than (l"^') pairs 
of vertices in X with a common neighbor in y. So there is a pair of vertices in X with no common 
neighbor in Y, contradicting H has diameter at most 3 and completing the proof. □ 

Lemma mf) together with the previous claim imply that a.s. any subgraph of G(n,n,p) of diameter 
3 with at least 2pn = ■y/n/2 edges has at least j^y^^ vertices and at most W^/nlogn vertices. 

Claim 2 Lei e = There is a vertex in Y that has at least (1 — e)\X\ neighbors in X. 

Proof: Since every pair of vertices of X have a common neighbor in Y we have Y^i^i (2*) — ('2^')' 
where di > d2 > ... > d^y] the degrees of vertices of Y in X. Suppose for contradiction that 
no vertex of Y that has at least (1 — e)\X\ neighbors in X. Let r = By Lemma El^e), 

J2l=i di < l-'^l + r'^ logn. In particular, convexity of the function f{x) = (^) demonstrates that 
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Since H and hence also X has at most lO-^nlogn vertices, we have that < logn. Thus, by 

Lemma[71^f), there are at most 15|X| log n edges from Y to X. This implies that di < dj. < 15|X| log n/r 
for all i > r. Under these constraints, we have by convexity of the function f{x) = (2) that 

E (2O - ^('^'^'2°^"/") < ^|Xplog2n= 120|X|V3iog2n« ^-\X\^/^ = 

i>r V / V / 

This together with the above estimate shows that Yl^^i (2') ^ ('^')' ^ contradiction. □ 

Take e as in Claim [2j Since there is a vertex in Y with at least (1 — e)\X\ neighbors in X and every 
vertex has degree (1 + o{l))pn, then \X\ < (1 + o{l))pn/{l — e) = (1 + o(l))pn. 

We next show that Y is also quite large if H has at least 2pn edges. By Claim [2l there is a vertex 
V £Y adjacent to at least (1 — e)\X\ elements of X. By Lemma[71^b), every other vertex in Y besides v 
has at most logn neighbors in N{v). Hence, there are at most \X\ + \Y\ logn edges between N{v) and 
Y. There are at most e|X| vertices in X\N{v), so there are at most 6 max(e|X| (e|X| + |y|) log n) 
edges between X \ N{v) and Y. Hence the number of edges of H is at most 

\X\ + |y|logn + 6max(e|X||y|p, {e\X\ + |y|)logn) < (1 + o(l))pn + 7|y| logn, 

where we use \X\ < (1 + o(l))pn. If |y| < g^jj^^, we get that there are less than 2pn edges in H, a 

contradiction. To summarize, we have the following inequalities: 3q^„ < 1^1 < |^| < (1 + o{l))pn. 

Now that we have established that X and Y are of similar size, the proof of Claim [2] with X and Y 
switched also gives us the following claim. The only estimate in the proof that needs to be checked 
is that |Xp(logn)^/r <C el^P, and since \X\ and |y| are both of the form 71^/"^+°^^ r = \Y\^/^ and 
e = this clearly holds. 

Claim 3 Let e = \Y\~'^/^ . There is a vertex in X that has at least (1 — e)\Y\ neighbors in Y . 

We now complete the proof of Lemma [5l Let y be a vertex in Y with at least (1 — e)\X\ neighbors 
in X and x be a vertex in X with at least (1 — e)|y| neighbors in Y, where e = Such 
vertices x and y exist by Claims [2] and [3l since \Y\ < \X\. Let Xi be the set of neighbors of y in 
X, and X2 = X \ Xi. Let Yi denote the set of neighbors of x in Y, and Y2 = Y \ Yi. By Lemma 
nic) if X and y are adjacent and Lemma [71[a) and (d) if x and y are not adjacent, there are at most 
(1 + o(l))(2pn + p^n'^) edges between Xi U {x} and Yi U {y}. Since Xi consists of neighbors of y, 
by Lemma [Tl^b), each vertex in Y2 \ {y} has at most logn- neighbors in Xi. Similarly, each vertex in 
X2 \ {x} has at most logn neighbors in Yi. Lemma [7]^f) implies that the number of edges between X2 
and Y2 is at most 

6max(|X2||y2|p, (|^2| + |Y2|)logn) < 6max(e|X|e|y|p, {e\X\ +e|y|)logn) < 20e|X|logn = o(n^/2)_ 

Putting these inequalities altogether, the number of edges between X and Y, and hence the number 
of edges of -ff , is at most 

(1 + o{l)){2pn + p^n^) + |X2|logn+ |y2| log n + ©(n^/^) = (1 + o(l))(2pn + p^n^), 

where we use \X2\ < e|X| = o(n-^/^) and II2I ^ e|Y| = o{n^^'^), which completes the proof. □ 
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2.3 Decomposing a graph into diameter 4 subgraphs 

In this subsection, we prove the last claim of Theorem[Tl that P{n,€,4:) = Q{l/e) for e > 1/n. 

Definition 5 For a vertex v and graph G, let Nj.[v) he those vertices of G which are within distance 
at most r from vertex v and Gr{v) denote the induced subgraph of G with vertex set Nr{v). 

The graph Gr{v) of course has radius at most r and hence diameter at most 2r. Note that Gr{v) = 
G2r{v,v) (defined in the previous section) as any vertex at distance at most r from v is contained in 
a walk from w to w of length at most 2r, and any walk from w to v of length at most 2r contains only 
vertices of distance at most r from v. 

To bound P{n, e, 4) from above we use the following lemma, which is tight apart from the constant 
factor as demonstrated by a disjoint union of cliques of equal size (see the proof of Lemma [9] below) . 

Lemma 8 If a graph G has n vertices and at least m > An edges, then it has a subgraph with diameter 

2 

at most 4 and at least edges. 

Proof: Delete vertices one by one of degree at most The resulting induced subgraph G' has 
at least m — = m/2 edges and minimum degree at least For any vertex v of C, ^2(1') has 
diameter at most 4 and at least (|^)^/2 > |^ edges as v and its neighbors have degree at least □ 

We now prove a quantitative version of the upper bound on P(n, e,4) in Theorem [T|^c). 

Theorem 6 Every graph G on n vertices can he edge partitioned E = Eq U Ei U . . . U Ei such that 
\Eo\ < en?, i < lQe~^ , and for I <i < i, the diameter of Ei is at most 4. 

Proof: We repeatedly use Lemma [8] to pull out subgraphs of diameter at most 4 until the remaining 
subgraph has at most en^ edges. The remaining at most en^ edges make up Eq. If the current graph 
has at least m/2 edges, then by the above lemma we can pull out a subgraph of diameter at most 4 with 
at least (m/2)^/8n^ edges. Therefore, after pulling out s = (m — m/2)/ ^"^^a^ = 16n^/m subgraphs 
of diameter at most 4 from our graph, we remain with at most m/2 edges. Similarly, applying this 
process to a graph with at most 2*en^ edges we get a subgraph with at most 2*~^en^ edges, after we 
pull out at most = 2^~*e~^ subgraphs of diameter 4. Summing over all i > 1, the total number 
i of subgraphs of diameter at most 4 we pull out is at most Yli^i 2^"*e^^ = 16e~^. □ 

The next lemma shows that Theorem [6] is tight apart from a constant factor. 
Lemma 9 // ^ < e < ^ , then P{n, e, d) > 

Proof: Let t = ^ (note that t < n/2) and let G be a graph consisting of n vertices partitioned into 
t disjoint cliques each of size n/t. The graph G has ^("2*) edges. Any connected subgraph of G has 
("/*) edges. Hence, any partition E = Eq U Ei U . . . U Eg such that |£^o| < en? < ^('"a*) and each Ei 
is connected for 1 < i < £ must have £ > t/2 = □ 
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3 Covering graphs of large minimum degree 



In this section, we prove results on the minimum number of subgraphs needed to cover all the edges of 
a graph of large minimum degree by low diameter subgraphs. First we show, using a simple sampling 
argument, that the edges of every graph with n vertices and minimum degree linear in n can be covered 
by O(logn) subgraphs of diameter at most 3. 

Theorem 7 Let G = {V,E) he a graph on n vertices with minimum degree at least en. Then there is 
a covering E = Ei U . . . U Ei of the edge set of G such that I = 2e~^ log n and for \ < i < I, Ei has 
diameter at most 3. 

Proof: Pick i pairs {vi,Wi) of not necessarily distinct vertices uniformly at random with repetition. 
If there is a path of length at most 3 between Vi and Wi, let G^{vi,Wi) be the induced subgraph of 
G as in Definition [J] and Ei be the edge set of Gs{vi,Wi). By Lemma [3l G^{vi,Wi) has diameter at 
most 3. Let e = {v,w) be an edge of G. If Vi S {v} U N{v) and Wi G {w} U N{w), then Gz{vi,Wi) 
contains e as there is a walk from Vi to Wi of length at most 3 containing e. So the probability 
that a given edge e = of G is in G-s{vi,Wi) is at least ^-^^^^ > g2_ gjj^fjg -^g ^re picking 

£ = 2e~^logn pairs of vertices uniformly at random, the probability e is in none of these subgraphs 
is at most (1 — e'^Y < e~^^°s" = n~^. Summing over all edges of G, the expected number of edges of 
G that are not in any of the Ei is at most (2)^^"^ < 1/2- Hence, there is a choice of £"1, ... , E^, each 
having diameter at most 3, that together cover all edges of G. This completes the proof. □ 

The family of all subsets of [n] = {1, . . . , n} of size k is denoted by ('^^) . The Kneser graph KG(?i, k) 
has vertex set (^^'), where two sets of size k are adjacent if they are disjoint. A famous theorem 
of Lovasz |10j . who proved Kneser's conjecture using topological methods, states that the chromatic 
number of KG(n, k) is n — 2fe + 2 for n > 2k > 2. We use this property of Kneser graphs to construct a 
graph demonstrating that the bound in Theorem [7| is tight up to a constant factor. This will complete 
the proof of Theorem [2][a) . 

Theorem 8 For every sufficiently large N, there is a graph G on N vertices with minimum degree at 
least 2''^N whose edges cannot be covered by ^ log2 N subgraphs of diameter at most 4. 

We establish Theorem [8] using the following lemma. 

Lemma 10 Let = Then there is a graph F^ on vertices with minimum degree afc/16 such 

that any covering of the edges of Fk by subgraphs of diameter at most 4 uses at least 2k + 2 subgraphs. 

To deduce Theorem [5] from Lemma [TUl we let k be the largest positive integer such that < N. 
If Ofc is not exactly N, we can duplicate some vertices of Fk if necessary to get the desired graph G 
with N vertices. Indeed, as A'^ < Ofc+i < lOa^ and = 4(^^) < 2^^, we have that G has minimum 
degree at least a^/lG > 2~^N and the edges of G cannot be covered by less than 2k + 2 > ^ log2 N 
subgraphs of diameter at most 4. It thus suffices to prove Lemma [Till 

The incidence graph IG(n, k) is a bipartite graph with first vertex class (^^') and second vertex class 
[n], where i G [n] is adjacent to S" C ('^^) if i E S. Every vertex in the first vertex class has k neighbors, 
and every vertex in the second vertex class has {^Z]) = Ti.iX) neighbors. Two vertices 81,82 S (^^') 
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have a common neighbor in IG(n, k) if and only if they have nonempty intersection. So any coloring 
of ('^') such that any pair 81,82 of the same color have a neighbor in common in lG{n,k) gives a 
proper vertex coloring of KG(n, fc). Since IG{n,k) is bipartite, the number of colors of any coloring 
of ('^^) in which any pair 81, 82 of the same color have distance less than 4 in IG(n, k) is at least the 
chromatic number of KG(n, k), which is n — 2fc + 2. 

Let H(n,A;,t) denote the graph with first vertex class (^^') and second vertex class [nt], where the 
bipartite graph between ('^^) and {n{j — 1) + 1, . . . ,nj} make a copy of IG{n,k) for I < j < t. By 
construction, any coloring of (j-^) in which any pair 81,82 of the same color have distance less than 

4 in H(n,/c,t) uses at least n — 2k + 2 colors. Let t = ^(^^) and i/^ = H^Ak, k,t). The number of 
vertices of is 0^/2, and every vertex has degree = afe/16. We have established the following 
lemma. 

Lemma 11 The bipartite graph is a^/lG-regular with afc/2 vertices, and every coloring of the first 
vertex class of Hk such that every pair of vertices of the same color have distance less than 4 uses at 
least 2k + 2 colors. 

Let Fk be the graph consisting of two disjoint copies H^,, of Hk with an edge between the two 
copies 8^ and 8^ of 8 for each vertex 8 G (j-^) in the first vertex class of Hj^. Notice that Fj. has 
2\Hk\ = vertices and has minimum degree afc/16. To complete the proof of Lemma [TOl and hence 
of Theorem [HI it suffices to show that in any covering E{Fk) = Ei\J . . .VJ Ei va. which each Ei has 
diameter at most 4, the number i of subgraphs used is at least 2k + 2. Given such an edge-covering of 
Fk, define a coloring x ■ {^^) — * M as follows. Let x{S) = i if i is the smallest positive integer such 
that the edge between the two copies of 8 in F^ is in Ei. 

The key observation is that if x{S) = x{T) = i and the distance between 8 and T is at least four in 
Hk, then the distance between 8^ and is at least 5 in i^^, which contradicts that Ei has diameter 
at most 4. Indeed, in any path from 8^ to T^, one of the edges between the two copies of Hj- must 
be used, as well as at least four edges inside copies of H^. Thus this coloring satisfies conditions of 
Lemma [TT] and therefore x uses at least £> 2k + 2 colors. This completes the proof of Lemma [TOl and 
of Theorem [8l □ 

We next show how to cover all the edges of a graph of large minimum degree by a small number of 
subgraphs each of diameter at most 5. Recall from Section [2^2] that Gd{v,w) is the induced subgraph 
of G consisting of all vertices on a walk from to w; of length at most d. Lemma [3] states that Gd{v, w) 
has diameter at most d. As in Section [2.31 Nr{v) denotes all the vertices of G within distance at most 
r from v and Gr{v) is a induced subgraph of G with vertex set Nr{v). 

Lemma 12 We have Q{n,e,5) < e^^. That is, every graph G with n vertices and minimum degree at 
least en has an edge covering E = EiU . . .U E^ with each Ei having diameter at most 5 and £ < e^^. 

Proof: Let {vi, . . . , vt} be a maximal set of vertices in G of distance more than 2 apart from each 
other. By definition, every vertex of G has distance at most 2 from one of these vertices. It follows 
that every edge of G has both its vertices in some N2{vi) or one vertex in a set N2{vi) and the other 
vertex in a different set [vj ) . Note that in the latter case the edge lies in a walk of length at most 

5 from Vi to Vj. For each pair Vi,Vj of distance at most 5, we will use the subgraph G^{vi,Vj). Also, 
we will use the subgraph G2{vi) for each i. The number I of subgraphs we use is at most i + (2) 
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and every edge is in at least one of these diameter at most 5 subgraphs. Since Ni{vi), . . . , Ni{vt) are 
disjoint sets of vertices, each of size at least 1 + en, then t < e^^ and hence £ < e~^. □ 

We now present a lower bound for Q{n, e, 5). 

Theorem 9 There are positive constants c and d such that Q{n,e,5) > ce~^ for e > c'(logn)~^/^. 

Let k be minimum positive integer such that d := n/a^ < 2^^e^^ with = ^{^^)- The assumption 
e > (/(log n)"'^/^ in Theorem [U] and the fact that grows exponentially in k implies that k > d^ for 
some appropriately chosen constant c'. Since Ok+i < lOofc, we have d > 10- 2^'^e~^. We will construct 
a graph G on n vertices with minimum degree at least en such that any covering of the edges of G by 
subgraphs of diameter at most 5 uses at least d? > 2~^^e~^ subgraphs, which implies Theorem [9l 

Let iJfc be the bipartite graph as defined before Lemma [TTl By Lemma [TH is Ofc/lG-regular with 
afc/2 vertices, and every coloring of the first vertex class of Hk such that every pair of vertices of the 
same color have distance less than 4 uses at least 2k + 2 colors. Let F be the graph consisting of d 
disjoint copies of with no edges between them. The graph F is bipartite, and we call the union 
of the d copies of the first vertex class of the first vertex class of F, and the remaining vertices 
the second vertex class of F. Let G be the graph consisting of two disjoint copies Fi, F2 of F, with a 
certain matching between the first vertex class of Fi and the first vertex class of F2, which we define 
in the next paragraph. 

Recall that the first vertex class of Hf^ is the vertex set of the Kneser graph KG(4A;,fc), and a 
pair of vertices in the first vertex class of Hf^ have distance less than 4 if and only if they are not 
adjacent in KG(4A;, /c). Let (^^^^) = Ui U . . . U Ud he a partition of the vertex set of the Kneser 
graph KG(4/c, k) into d sets such that for 1 < i < d the induced subgraph of KG(4/c, k) with vertex 
set Ui has chromatic number at least [2k /d\ > 2d^. Such a partition exists since for any partition 
x(KG(4A;, k)) = pi + ■ ■ ■ + oi the chromatic number of a graph KG(4/c, k) into nonnegative integers, 
there is a partition Ui U . . . U Ud of the vertex set into subsets such that the induced subgraph of 
KG{4:k,k) with vertex set Ui has chromatic number pi. Indeed, we can take Ui to be the union ol pi 
color classes in a proper coloring of KG(4A;, fc) with x(KG(4A;, fc)) colors. For j G {1,2}, 1 < i < d, 
and S C [4/c] with \S\ = k, let Sij be the copy of vertex S in the ith copy of Hf^ in Fj. Suppose 
that S € Uf,, then the matching between the first vertex class of Fi and the first vertex class of F2 is 
defined by Si^i is adjacent to 5^+6^2; where i + 6 is taken modulo d. We denote by Aij and Bij the 
first vertex class and the second vertex class, respectively, of the ith. copy of in Fj. 

By definition, graph G has 2d • ak/2 = dok = n vertices and minimum degree at least afe/16 > en. 
Hence, Theorem [5] follows from the next lemma. 

Lemma 13 Any edge covering of G by diameter at most 5 subgraphs uses at least d'^ subgraphs. 

Before proving Lemma [T3l we first establish some properties of distances between vertices in G. 

Claim 4 If V £ Aij and v' G Ai'j with i 7^ i' , then v and v' have distance at least 4 in G. 

Let V G Aij and v' S Ai'j be the closest pair of vertices in these two sets. To verify the claim, let 
V = vi,V2, ■ . . ,vt = v' be a shortest path from v to v' . The vertex V2, being a neighbor of v, must 
be in Bij or in the first vertex class of i^s-j- If V2 S Bij, then ^3 G Aij and we could instead start 
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from U3 and get to v' in a shorter path, a contradiction. Therefore, V2 is in the first vertex class of 
F^-j. Similarly, we must have Vf-i is in the first vertex class of -Fs-j- We have V2 7^ ft-i since the 
edge between v and V2 is one of the edges of the matching, and the edge between vt-i and v' is one 
of the edges of the matching, which would otherwise imply that V2 is in two edges of a matching, a 
contradiction. We have ^ v as otherwise we would get a shorter path from v to v' . This implies 
that f3, being a neighbor of V2, lies in the second vertex class of i^s-j- Hence t — 1 > 4 and the distance 
between v and v' is at least 4. 

Claim 5 If V £ Bij and w G Bi/ j with i 7^ i' , then v and w have distance at least 6 in G. 

Indeed, consider a shortest path from v to w. The second vertex of this path is in Aij as all the 
neighbors of v lie in and the second to last vertex of this path is in Aii^j as it is adjacent to w. 
By Claim m any vertex in Ai^j has distance at least 4 from any vertex in ^j'j, therefore it follows that 
V and w have distance at least 6. 

Proof of Lemma I13t Suppose for contradiction that there are r < d'^ subgraphs Gi, . . . ,Gr of G 
each of diameter at most 5 which cover the edges of G. It follows from Claim [5] that any diameter at 
most 5 subgraph of G cannot contain a vertex in Bij and also a vertex in -Bi'j- with i 7^ i' . Hence, 
for each h, 1 < h < r, there is at most one pair (i, i') such that Gh contains both a vertex of Bi i and 
a vertex of -Bi'.2- Since r < d'^, the pigeonhole principle implies that there is a pair such that 

no Gh contains a vertex of Bi i together with a vertex of Fix such a pair {i,i')- Let Vh denote 

the collection of all sets S G (^^/f^) such that vertices Si^i and Si'^2 form an edge of the matching that 
belongs to Gh- By definition of such an edge, the set S is in C/j/_j, where the subscript is taken modulo 
d. Recall that the induced subgraph of KG(4A;, k) with vertex set ?7j'_j has chromatic number at least 
2d^. We use the following claim. 

Claim 6 If S,S' are vertices in the first vertex class of Hi;., then the distance between Sij and S[ ^ in 
G is the distance between S and S' in Hj.. 

Indeed, Sj j- and S[ j belong to a copy of if^ in G, so their distance in G is at most the distance of 
S and S' in H/^. In the other direction, it is easy to see that any path in G from Sij to 5- ^ can be 
projected onto a path which is not longer from S to 5' in Hk, which verifies Claim [6l 

The next claim completes the proof of Lemma [T3l Indeed, since Gi, . . . ,Gr cover the edges of G, we 
must have ?7i'_j = Fi U . . . U V^. Claim [7] implies that each Vh forms an independent set in KG(4fc, k) 
and hence r > 2d^. 

Claim 7 Each pair of vertices in Vh have distance less than 4 in H^, i.e., Vh forms an independent 
set in KG{4:k, k). 

To prove Claim [71 suppose for contradiction that S*, S" G V/^ have distance at least 4 in H^^. Without 
loss of generality, suppose Gh contains no vertex in Bi i (the other case in which Gh contains no vertex 
in Bii ^2 can be treated similarly). We claim that the distance between Si^i and S'^-^ in Gh is at least 
6. Indeed, the only vertex adjacent to Si^i in Gh is 5j/^2 and the only vertex adjacent to S[ ^ in Gh is 
S[, 2- Since S and 5' have distance at least 4 in iJ^, by Claim [6l 5j/^2 and S[i 2 have distance at least 4 
in G and hence also in Gh- Therefore, Si^i and Si' ^2 have distance at least 6 in G/i, contradicting Gh 
has diameter at most 5. □ 
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We next show that Q(n,e,Q) = 0(e ^) for e > 1/n. The lower bound follows by considering a 
disjoint union of cliques, each with at least en + 1 vertices. The upper bound also has a simple proof. 

Lemma 14 Every graph G with n vertices and minimum degree at least en has an edge covering 
E = EiU . . .U El with each Ei having diameter at most 6 and I < . 

Proof: Let {vi, . . . , ve} be a maximal set of vertices in G of distance more than 2 apart from each 
other. Then every vertex of G has distance at most 2 from one of these vertices and therefore every 
edge of G has both of its endpoints within distance at most 3 from some Vi. This implies that the I 
subgraphs G^{vi), . . . , G^{v() cover all the edges of G and each of them has diameter at most 6. Since 
Ni{vi), . . . , Ni{vi) are disjoint sets of vertices of size at least 1 + en, then £ < e~^. □ 

4 Decomposing hypergraphs into low diameter subgraphs 

We start this section by showing how to decompose the edge set of a /c-uniform hypergraph on n 
vertices, apart from at most en^ edges, into a small number of subhypergraphs of diameter at most 
3. This will establish an upper bound on Pfc(n,e, 3). We then present two constructions giving lower 
bounds on Pfc(n, e, d). 

Let denote the following /c-uniform hypergraph on 2k vertices with k + 2 edges. Its vertex set 
consist of two disjoint /c-sets V = {vi}^^^ and W = {wi}^^i. The sets V and W are edges of H^. For 
1 < i < k, vertex Wi together with all vertices Vj,j ^ i also form an edge of H^. 

Let G be a /c-uniform hypergraph and let e = {vi, . . . ,Vk} be a fixed edge of G. Consider all the 
vertices of G which are contained in some edge of G which intersects e in at least k — 1 vertices. Let 
G(e) be the subhypergraph of G induced by this set. Since the intersection of e with itself has size k, 
by definition, all the vertices of e are in G(e). Moreover e is an edge of G{e) as well. 

Lemma 15 For each edge e of a k-uniform hypergraph G, the diameter of G{e) is at most 3. 

Proof: Suppose a,b are distinct vertices of G{e). Then there are two indices 1 < i, j < k such that 
{a} U (e \ {vi}) and {5} U (e \ {vj}) are both edges of G{e). If i = j, then the sequence a, followed by 
all elements of e\vi, followed by 6 is a path of length 2. li i ^ j, and a = Vi or b = Vj , then a and b 
are in an edge of G{e). If i ^ j and neither a = Vi nor b = vj, then the sequence with first element a, 
followed by Vj, followed by all vertices of e\{vi,Vj}, followed by Vi and finally by 6 is a path of length 
3. In any case, the distance from a to b, and hence the diameter of G(e), is at most 3. □ 

The number of edges of G{e) is at least the number of copies of in G for which the image of set 
V is fixed as e. Indeed, if two disjoint edges e and / (together with some other edges of G) form a 
copy of H'^ with / being the image of W, then each vertex in / is in G(e), and so / is an edge of G(e). 

The edge density of a /c-uniform hypergraph is the fraction of subsets of vertices of size k which are 
edges. Let K{t; k) denote the complete A;-partite /c-uniform hypergraph with parts of size t, whose 
edges are all the /c-sets which intersect every part in one vertex. This hypergraph has kt vertices and 
t^ edges. The following well known lemma is proved by a straightforward counting argument and 
induction on the uniformity k (see (7j)- 
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Lemma 16 Fix positive integers k and t. If G is a k-uniform hypergraph with n vertices and edge 
density e with e ^ n^* , then G contains 0(e* n^'*) labeled copies of K{t;k). 

Let G be a /c-uniform hypergraph with n vertices and edge density e. Since is a subhypergraph 
of K{2;k), the above lemma imphes that G contains r2(e^ n^'^) labeled copies of K{2;k) and hence 
of H^. Therefore, there is an edge e for which G{e) contains at least r2(e^'°n^'^/e(^)/c!) = r2(e^^^''n'^') 
edges. By Lemma \T5\ this subhypergraph has diameter at most 3. Now, as we already did in the 
previous sections, we can use the above fact to pull out from G subhypergraphs of diameter 3 with 
many edges until there are at most en'^ edges left. Using a very similar computation as in the proofs 
of Theorems U] and [6l we obtain Theorem Elthat Pk{n,e,3) = 0{€^-^') for e > n'^ . 

We next give a lower bound on Pk{n, e, 3) which we conjecture is tight apart from the constant factor. 

Theorem 10 We have Pfc(n,e,3) > c^e"'^ for G'f.n"^^'^ < e < Gk, where Ck, Gk and are positive 
constants depending only on k. 

We prove this theorem by showing that there is a dense fc-uniform hypergraph with no large subhy- 
pergraph of diameter at most 3. More precisely, the next lemma shows that there is a hypergraph with 
n vertices, at least 2en^ edges, and every subhypergraph of diameter at most 3 has at most c'^^e^'^^n^ 
edges. Hence Pk{n,e,3) > (en'^)/(c^^e^"'"^n^) = c^e"^. 

Lemma 17 For each integer k > 2, there are positive constants Ck, Ck and G'f^ such that the following 
holds. For all sufficiently large n and e satisfying G'f,n~^^^ ^ €. < Gk, there is a hypergraph H on at 
most n vertices with at least 2en^ edges such that every subhypergraph of H with diameter at most 3 
has at most c'^^^e^^^n^ edges. 

Proof: The proof is by induction on k. We have already established the base case /c = 2 of this lemma 
in the proof of Theorem O Let Gk+i = 2-^-^Gk and let e < Gk+i- Fix 5 = 2^+^e and N = n/2. Our 
induction hypothesis implies that there is a /c-uniform hypergraph G with vertices and 25N^ edges 
such that every diameter at most 3 subhypergraph of G has at most c'^^5^~^^N^ edges. So G has edge 
density a = 25N''/{^) < 4k\6. 

Let Gi, . . . ,Gt he t = ^ random copies of G on the same vertex set [N], where Gi is formed 
by considering a random bijection of V(G) to [A^] picked independently of all the other Gj. The 
probability that a given A;-tuple contained in [A^] is an edge of at least one of these t copies of G is 

1 - (1 - a)* > 1 - e-°* = 1 - > 1/2. 

By linearity of expectation, the expected number of edges which are contained in at least one of the 
t copies of G is at least ^ (^) . So we may pick Gi, . . . ,Gt such that at least ^ (^) of the fc-tuples 
contained in [A^] are in at least one of these t copies of G. 

If an edge e is in multiple copies of G, arbitrarily delete it from all Gi that contain it except one. Let 
G'^ be the resulting subhypergraph of Gi. Introduce new vertex sets Vi, . . . ,Vt, each of size N/t, and 
define a (A: -|- l)-uniform hypergraph H on [A^] U U . . . U Vt as follows. The edges are those {k + l)-sets 
which for some i contain an edge of G[ together with a vertex of Vi. The number of vertices of H is 
2A^ = n and the number of edges is at least 
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Note that every edge of H has exactly one vertex in some Vi. Let H' be a diameter at most 3 
subhypergraph of H. Since the G'^ have distinct edges, the vertex set of any strongly connected 
subhypergraph of H, and hence of H' , intersects at most one Vi. Therefore, there is an i such that 
V{H') cViU [N] and E{H') C Vi x E{G'i). Since each edge of H' has exactly one vertex in Vi, note 
that if we delete from any tight path in H' all the vertices of Vi we obtain a tight path in G'- . Therefore 
the subhypergraph of G'^ whose edges are those which are subsets of edges of H' also has diameter at 
most 3. Since G'^ is a subhypergraph of a copy of G, any diameter 3 subhypergraph of has at most 
c^^d^^^N^ edges. Thus, using that n = 2N and 1/t = a/2 < 2kl6, we obtain that H' has at most 

iVilc^^S^'+^N^ = {N/ty-H'^+^N^ < 2k\c;H''+^N^+^ = 2^'+^^+^k\c~^\^+^n''+^ 

edges. Letting Ck+i = 2~^^~^^~^Ck/kl completes the proof by induction. □ 

Essentially the same proof gives the following lemma. The only difference is the base case A: = 2 of 
the induction is given by taking to be a disjoint union of cliques of equal size. Call a /c-uniform 
hypergraph strongly connected if there is a tight path between any two vertices in the hypergraph, i.e., 
the diameter of the hypergraph is finite. 

Lemma 18 For each integer k > 2, there are constants Ck,Ck and G'^ such that the following holds. 
For all sufficiently large n and e satisfying C'ijn~^ < e < Gfc, there is a hypergraph H on n vertices 
with 2en^ edges such that every strongly connected subhypergraph of H has at most c'^^e^n^ edges. 

The hypergraph H in the previous lemma demonstrates the following corollary. 

Corollary 1 Pjt{n,e,d) > c^e^"*^ for G'f^n~^ < e < Gfc and any d. 

5 Concluding remarks 

In the previous section, we establish both upper and lower bounds on Pk{n, e, d). We think the lower 
bounds in Theorem [10] is best possible up to a constant factor, i.e., all but at most en^ edges of every 
n-vertex /c-uniform hypergraph can be partitioned into 0{e~^) subhypergraphs of diameter 3. We 
also believe that Corollary [T] is tight and for each integer A; > 3 there is another integer d(k) such 
that Pk{n,e,d{k)) = 0{e^-^). Note that Theorem □ shows that both our conjectures hold for graphs 
{k = 2). 

Improving our upper bound on Pfc(n, e,3) would also be interesting. One possible way to do so is 
to show that there are many copies of hypergraph in every fc-uniform hypergraph on n vertices 
with edge density e. This problem is closely related to the well know conjectures of Simonovits [H] 
and Sidorenko [13], which suggest that for any bipartite graph the number of its copies in any 
graph G on n vertices and edge density e (e > n~"'^^^) is asymptotically at least the same as in the 
n-vertex random graph with the same edge density. So far it is known only in very special cases, i.e., 
for complete bipartite graphs, trees, even cycles (see [l3]), and recently for cubes [8]. It is tempting 
to conjecture that an analogous statement holds for A:-uniform /c-partite hypergraphs (A: > 3). A k- 
uniform hypergraph is k-partite if there is a partition of the vertex set into k parts such that each edge 
has exactly one vertex in every part. Simonovits-Sidorenko conjecture for hypergraphs would say that 
for any A;-partite fc-uniform hypergraph H, the number of its copies in any A;-uniform hypergraph on n 
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vertices with edge density e (e > n~^^^')) is asymptotically at least the same as in the random n- vertex 
/c-uniform hypergraph with edge density e. It holds for complete fc-partite /c-uniform H (Lemma 1161 is 
essentially the case when all parts of H have equal size). However, as shown by Sidorenko [T3|, this is 
false in general. Nevertheless, it is still an intriguing open problem to accurately estimate the minimum 
number of copies of a fixed hypergraph H that have to appear in every A;-uniform hypergraph on n 
vertices with edge density e. 

References 

[I] N. Alon, A. Gyarfas and M. Ruszinko, Decreasing the diameter of bounded degree graphs, J. 
Graph Theory 35 (2000), 161-172. 

[2] F. Chung and M. Garey, Diameter bounds for altered graphs, J. Graph Theory 8 (1984), 511-534. 

[3] P. Erdos, A. Gyarfas and M. Ruszinko, How to decrease the diameter of triangle-free graphs, 
Combinatorica 18 (1998), 493-501. 

[4] P. Erdos, J. Pach, R. Pollack, and Z. Tuza, Radius, diameter, and minimum degree, J. Combin 
Theory B 47 (1989) 73-79. 

[5] P. Erdos and A. Renyi, On a problem in the Theory of Graphs (in Hungarian), Publ Math Inst 
Hungar Acad Sci 7 (1962), 623-641. 

[6] P. Erdos, A. Renyi, and V. T. Sos, On a problem of Graph Theory, Studia Scientarium Mathe- 
maticarum Hungar 1 (1966), 215-235. 

[7] P. Erdos and M. Simonovits, Supersaturated graphs and hypergraphs, Combinatorica 3 (1983) 
181-192. 

[8] H. Hatami, Graph norms and Sidorenko's conjecture, to appear in Israel J. Math. 

[9] N. Linial and M. Saks, Low diameter graph decompositions, Combinatorica 13 (1993), 441-454. 

[10] L. Lovasz, Kneser's conjecture, chromatic number, and homotopy, J. Combin. Theory A, 25 
(1978), 319 

[II] J. Polcyn, V. Rodl, A. Rucihski, E. Szemeredi, Short paths in quasi-random triple systems with 
sparse underlying graphs. J. Comb. Theory, Ser. B 96 (2006), 584-607. 

[12] J. Polcyn and A. Rucihski, Short paths in e-regular pairs and small diameter decompositions of 
dense graphs, to appear in Discrete Math. 

[13] A. F. Sidorenko, A correlation inequality for bipartite graphs. Graphs Combin. 9 (1993), 201-204. 

[14] M. Simonovits, Extremal graph problems, degenerate extremal problems and super-saturated 
graphs, in: Progress in graph theory (J. A. Bondy ed.). Academic, New York, 1984, 419-437. 



18 



